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BI-FRAMELET SYSTEMS WITH FEW VANISHING MOMENTS 
CHARACTERIZE BESOV SPACES 

LASSE BORUP, REMI GRIBONVAL, AND MORTEN NIELSEN* 


Abstract. We study the approximation properties of wavelet bi-frame systems in L p (R d ) 
For wavelet bi-frame systems the approximation spaces associated with best m -term ap¬ 
proximation are completely characterized for a certain range of smoothness parameters 
limited by the number of vanishing moments of the functions in the dual frame. The 
approximation spaces turn out to be essentially Besov spaces, just as in the classical 
orthonormal wavelet case. We also prove that for smooth functions, the canonical ex¬ 
pansion in the wavelet bi-frame system is sparse and one can reach the optimal rate of 
approximation by simply thresholding the canonical expansion. For twice oversampled 
MR A based wavelet frames, a characterization of the associated approximation space is 
proved without any restrictions given by the number of vanishing moments, but now the 
canonical expansion is replaced with another linear expansion. 


1. Introduction 

Given a finite collection of functions \I/ = ^ 2 ,..., C L 2 (R d ) we use the notation 

V(\I/) to denote the corresponding “wavelet” system, 

X(^) := (2 id/ V(2 i • -k) I j <E Z, k £ Z d , i = 1,2,..., L). 

A wavelet bi-frame for L 2 (R d ) consists of two sequences of wavelets 

$ = L 2 (R d ) and § = ^ ..., c L 2 (R d ) 

for which the systems A ('I') and A (\P) are Bessel systems, and satisfy the perfect recon¬ 
struction formula 

(i-i) / = £ £ V € mr“), 

jez,kez d 

where 

t,k := 2 ^( 2 ^'• -*), jez, he z d . 

This definition implies that both X(\&) and X(4>) are frames for L 2 (M d ) and in fact the 
roles of ^ and T' are interchangeable in (1.1). The special case with 4/ = T' corresponds 
to a so-called tight wavelet frame. 
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The most common method to construct wavelet bi-frames relies on so-called extension 
principles. The resulting bi-frames are based on a multiresolution analysis, and the gen¬ 
erators are often called bi-framelets. The construction of multiresolution-based wavelet 
frames have been studied extensively, see e.g. [6, 8, 22, 23, 24]. 

In this paper we study the nonlinear approximation properties of the systems A(T) 
and V(T) when the approximation error is measured in L p (M. d ). That is, we consider the 
(nonlinear) set 


E m (XW) :={j2 c i9i 

ie a 

of all possible m-term expansions with elements from the system X(\k). The error of 
the best ra-term approximation to an element / € L p (R d ) is then a m (f, V(T)) P := 
inf/ meSm (x(w)) ||/ — fm\\L p (K. d )- A first problem (Problem 1) is to characterize the class 
A a of functions / E L p (R d ) for which a m (f : V(T)) P = 0(m~ a ). A related problem 
(Problem 2) is to characterize the class /C T of functions / with expansions in V(T) that 
are sparse in the sense that the £ r norm of the expansion coefficients is finite. 

In the special case where V(T) is indeed a (bi)orthogonal wavelet basis in L 2 (R d ), it is 
well known that when a — 1/r — 1/p, the two (families of) classes A a and 1C T essentially 
coincide and are indeed Besov spaces B da (L T (R d )). The decay, smoothness, and number 
of vanishing moments of the wavelet determine the range of values of r for which the char¬ 
acterization is true. We refer the reader to, e.g. [25, 2] for a definition and some properties 
of the Besov spaces. When V (T) is a redundant frame, since the perfect reconstruction 
formula (1.1) only provides one among an infinite number of possible expansions of / in 
the system X (T), one must a priori consider separately two families of “sparseness classes” 
depending whether sparseness is measured in terms of synthesis or analysis coefficients. 

Some results on approximation with tight wavelet frames were obtained by the authors 
in [3], but the results in the present paper are more general even for the tight wavelet 
case. For oversampled tight wavelet frames based on splines, the approximation spaces 
associated with best ra-term approximation in L p (R d ) were completely characterized by 
two of the present authors in [15]. 

We address Problem 1 in Section 5 and 6, where it is proved that <r m (/, X(^f)) p = 
0(m~ a ) if and only if / belongs to (essentially) a Besov space. Put another way, the 
wavelet bi-frame system completely characterize Besov spaces through the quantities 
a m (f^X(A>))p. This characterization holds for smoothness parameters a in a certain range 
limited by the number of vanishing moments of the dual frame X(T). For a univariate 
orthonormal wavelet system, smoothness and decay automatically imply a sufficient num¬ 
ber of vanishing moments, see e.g. [16, Theorem 3.4], but this is no longer true for wavelet 
bi-frame systems.We use the standard approach to handle the characterization problem, 
deriving so-called Bernstein and Jackson estimates for the system X(\&). 

A nice corollary of the estimates in Section 5 and 6 is that smooth functions (in the 
Besov sense) have very sparse canonical expansions (1.1) in the bi-frame system. This 
addresses Problem 2 and, again, this is true for functions with smoothness a in a certain 
range given by the number of vanishing moments of the analysis system A'(T). 

The set of approximation spaces we are able to characterize in Section 6 is limited by 
the number of vanishing moments of the functions in the dual frame T. In fact, we show 


Ci EC,gi E X(T),cardA < raj 
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in Section 3 that this limit cannot be improved in the general case. In Section 7, we 
consider approximation with an oversampled version of the wavelet bi-frame dictionary 
X(\k) in dimension d = 1. For such oversampled systems we prove that there is a Jackson 
inequality independent of the number of vanishing moments of the functions in \k. This 
leads to a complete characterization of the approximation spaces for oversampled wavelet 
bi-frame systems. 

The structure of the paper is as follows. In Section 2 we review the most common 
method to construct wavelet bi-frames using the so-called extension principles. Section 3 
is devoted to proving boundedness for certain matrix operators that will be needed to 
obtain Jackson inequalities for nonlinear approximation with wavelet bi-frame systems. 

In Section 4 we give some elements of (nonlinear) approximation theory: we define 
approximation spaces for a general dictionary and state the corresponding Jackson and 
Bernstein inequalities. In Section 5 we prove a general Bernstein inequality for wavelet 
bi-frame systems with compact support. 

The matrix lemmas from Section 3 are used in Section 6 where we consider Jackson 
inequalities for best m -term approximation with bi-frame systems in L p {R d ), and we 
discuss the fairly general case where a complete characterization of the approximation 
spaces -associated with best ra-term approximation in L p (R d ) with bi-frame systems- in 
terms of (essentially) Besov spaces is possible. 

In the final section of the paper, Section 7, we prove a characterization of the approx¬ 
imation spaces for oversampled wavelet bi-frame systems, again in terms of (essentially) 
Besov spaces. 

Appendix A contains the analysis of the stability properties of wavelet bi-frame expan¬ 
sions in L p ( R d ). We give a complete characterization of the L p (R d )-norm, 1 < p < oo, 
in terms of analysis coefficients associated with the frame, and prove that the bi-frame 
expansion gives an atomic decomposition for L p ( R d ). The characterization has the same 
form as the classical characterization of the L p ( K d )-norm by wavelet coefficients, see e.g. 
[20]. The main use of the results in this appendix is as a tool to derive a Jackson inequality 
for wavelet bi-frame systems. 

2. MRA BASED WAVELET BI-FRAME SYSTEMS 

In this section we will briefly describe how to construct MRA-based wavelet bi-frames 
-called bi-framelets- through so-called extension principles. The extension principles to 
construct bi-frames were introduced independently in [5] and [8]. We refer the reader to 
either [5] or [8] for a more detailed discussion of MRA based bi-frames. Below we use the 
notation of [8]. 

Let r — (to,7i, ... ,tl) and r = (fo, A,..., fy) be two sequences of 27rZ d -periodic 
essentially bounded functions. Assume that t 0 and f 0 both generate refinable functions 

0(2£) = T 0 (£)</>(£) and 0(2 6)=f o (£)0(O> 

satisfying 

lim0(£) = l and lim </>(£) = 1, 

£-X) v 7 £->-o ' 

esssup^ |0(£ — k)\ 2 < oo and esssup^ |0(£ — k)\ 2 < oo, 

kez d kez d 


with 
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where (f(f) is the Fourier transform of the function 4>(x). We associate the wavelets to r 
and t as follows 

^( 2 £) = ^(£)</>(£), ^( 2 £) = h(€)<f(€>- 

The spectrum a ((f)) associated to 4> is defined up to a null-set as 

cr(<f>) := {o> G [— 7 r, 7 r] d : </>(a> + 2 - 7 tA;) 7 ^ 0, for some k € Z d }. 

The spectrum a((f) associated to <f> is defined likewise. Assuming that the systems X(T) 
and A (T) are both Bessel, we define the mixed fundamental function of the parent 
vectors r and f by 

OO L j — 1 

©(f) : = 5^5^' r /(2 J 0^(2 J ’0 n T 0 (2 m £)fo(2 m £). 

j=0 1 m=0 

The following theorem proved in [ 8 ] is the main tool to create bi-framelet systems, the 
theorem is called the Mixed Oblique Extension Principle. 

Theorem 2.1 (Mixed OEP). Let r and t be the combined mask of the systems X(A?) 
and A r (T) ; respectively. Assume that the systems A ('-I/) and A(T) are Bessel systems. 
Suppose there exists a 2n-periodic function 0 satisfying 

a) 0 is essentially bounded, continuous at the origin, and 0 ( 0 ) = 1 . 

b) If £ G o(4>) fl cr(</>) and u e {0, ir} d such that £ + u € a(<p) fl cf( 4>), then 

6(2()rc(()W+^) + E t<(9WF = (® K) ’ if !f = ° 

10, otherwise. 

Then A(^) ; A(4) is a bi-framelet system. 

Remark 2.2. In many (most) interesting cases the spectra a((f)) and a((f)) are both equal 
to [— 7 r, n] d . For example, if the integer translates of the scaling functions <f> and (p are 
Riesz sequences, this is the case. 

When A (At) — A(T), Theorem 2.1 gives the so-called Oblique Extension principle, see 
[ 8 ]. If, in addition, 0 = 1, Theorem 2.1 reduces to the Unitary Extension Principle, see 
[23, 24]. ’ 

The reader can consult [5] and [ 8 ] for many explicit examples on how to construct 
framelet systems using the different extension principles. 

3. Some matrix lemmas 

This section contains mainly technical matrix lemmas that will be used to derive Jack- 
son inequalities for wavelet bi-frame systems in Section 6 . Given two functions if and ?/ 
with a prescribed number of vanishing moments and degree of smoothness, we study the 
range of r G (0, 00 ) for which the infinite matrix ^)]jez,fcez d defines a bounded 

operator on I T , where p G (1, 00 ), 1 = 1/p + 1/p' and 

(3.1) V jtk ■= 
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Notice that = ll'0||.L p (R d ) for all j and k. We are especially interested in the 

case where ip is a function from a bi-frame system with some (possibly few) vanishing 
moments and rj is a “nice” wavelet. 

In order to prove the main result of the section (Proposition 3.5) we need the following 
technical results. 


Lemma 3.1. Given p e [ 1 , oc), 7 > d and positive constants ci, c 2 , let 

2 ~mdci form> 0 , 


{ 2 —mdc 
2mdc 2 


for m < 0. 

Denoting \x\ the Euclidean norm of x 6 K d , for k , k' e Z d let 

f 2~ md l p {l + | k- 2- m k'\)-'i for m > 0 , 
a m -,k,k' - a m -k,k'{ 7) — 1 2 md/p'^ + | 2 ™A; - &'|)^ form< 0 , 

where 1/p' = 1 — 1/p. Suppose p (1 — Ci) < r < p (1 + c 2 ) and r > 1 . Then 
mezj'ezk'ez d kez d 

Proof. Lemma 8.10 in [ 21 ] implies for any {dk}k G l T . 1 < r < oc, 

J] (E Omw\d k \y < C 2 md ^M J2 \d k \\ for me Z. 
k'ez d kez d kez d 

The use of Lemma 8.10 in [ 21 ] is direct for m < 0 and a duality argument is used to get 
the estimate for m > 0. This estimate yields 

rnez j'ezk'ez d kez d 

<cj_] b m 2 md ^M E i^-^r 

mez j'ez fcez d 


= c(^6 m («5 1 .«)||fe t }||?,, 


ra£Z 


where <5i := c\ — (1 — r/p) and <5 2 := c 2 + (1 — r/p). Now, the lemma follows since 
(1 — ci) < r/p < (1 + c 2 ) implies c>i > 0 and 5 2 > 0 . □ 


With this lemma at hand we can now characterize a range of r for which the matrix 
[b m aj'-j-k,k']j,j>ez,k,k>€Z d defines a bounded operator on t T . 

Lemma 3 . 2 . Given p e [1, oo), 7 > d and positive constants c 1; c 2 , let b m and a m . k ,k' be 
as in Lemma 3.1. Then 

'/’ ' ^ 'y ' bji-jdji-j-k.k' | | ^ < Gt 11 { Cj } 11 Hj. 

j'€Z,k'€Z d jeZ,k€Z d 

for any r in the range 


A(ci) < t < p(l + c 2 ), 
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where 

(3.2) 


{ p( 1 — x) forx <1 — 1 /p, 

(x + 1/p) -1 for 1 — 1/p < x < 7 jd — 1/p , 
dj 7 for'j/d — 1/p < x. 


Proof. For r > 1, Holders inequality (for the sum over j), with 1 = 1/t + 1/t / yields 


V b j'-j a ?-r,k,k>\ c j,k i = V b )'-j +1/T (V a 

jez,kez d 


j'-j.k.k’V'j.k 


je z 


ke z d 


— f V! hj'-j') (V b i'-j ( V! a j'-I c j-k I ) j 

< C E bm (E ^m]k,k' \ Cj'—m,k 


t/r 


The result then follows using Lemma 3.1, provided that max(l,p(l — ci)) < r < p( 1 + c 2 ). 
For r < 1 we have 

V! ( V! j E V! b j'-j a j'-r,k,k'\ c j,k\ 

j'ez,k'ez d jez,kez d j'ez,k'ez d jez,kez d 

= EEE 6 ™(E<«'M t ) 

mezj'ezk'ez d kez d 

= VI V! V! b m ^ VI dm;k,k' I c pfc I j ■ 

m€Zj'<EZk'eZ d kez d 

In the last line we have used the fact that 

b m a m;k,k' b m{ c lt Q2) a m;k,k'('Y) cf)ci m . y k,k' (7) =: b md m -,k,k' 

with Ci = CiT — l/p(l — r), c 2 = c 2 r — l/p'(l — r) and 7 = 77 ". Notice that we have the 
same structure as in Lemma 3.1. Thus, using Lemma 3.1 with {dj^} = {\ c j,k\ T } € i l we 
obtain that the result holds provided that p( 1 — cf) < 1 < p(l + c 2 ) and 7 > d, which is 
equivalent to 

(3.3) r > max(d/ 7 , (ci + 1/p) -1 ). 


Now combining the results for r < 1 and r > 1 we see that if ci + 1/p < 1, the set of 
t < 1 that satisfy (3.3) is empty, and as max(l,p(l — ci)) = p(l — ci) we get the first part 
of the result. If C\ + 1/p > 1, we get the rest. □ 


Definition 3.3. For N e N and 7 > 0 we let Df/ (R d ) be the set of all functions / defined 
on W l with N derivatives and decay 7, i.e., for which there exists a constant c < 00 such 
that 

(3.4) \d a f(x)\ < c(l + |:r|) -7 for x <G R d , a <E N d , \a\ < N, 

where |ck| is the usual length of a multi-index. Likewise, we let M// (W 1 ) denote the set of 
all functions / with N vanishing moments and decay, i.e., for which 

f x a f(x)dx = 0 for a € N 1 , |o| < N, 
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and 

(3.5) \f(x)\ < C{ 1 + | x |)-d-*-7 for x G R d . 

We need the following lemma in order to prove Proposition 3.5. 

Lemma 3.4. Consider Ni,N 2 G N and 71,72 > 0. Suppose 77 G D^f(.R d ) fi R d ) and 

ip G Dpff (R d ) fi Mfff (M d ). Then, for j' > j, we have 

C2~V'-M n 1+d E) 

( 3 - 6 ) I (rjj,k**Pj',k') (dl < 75 , m-vu iuwT 


and for j' < j 


I (vj,k*ipj>,k') b)l < 


(1 + |27r - 2 3-3'W - k )' 


(1 + \2?x — k' — 2 ’i 7Z ik\yn 


where ★ denotes the convolution product. 


The result is well known (see e.g. Lemma 3.3 in [13]) but will be shown here for the 
sake of completeness. 

Proof. We will simply prove Eq. (3.6), since interchanging the role of ip and 77 will then 
provide Eq. (3.7). We notice that ( 77 ^ *ipj>,k') (x) — (?/ * ipj'-jxf) {2Px — 2 3 ~ 3 ' E — k). Thus, 
it suffices to prove Eq. (3.6) for j — k — k! — 0 and m j' — j > 0. Since ip has N± 
vanishing moments, a Taylor expansion of 77 gives 


\rpk1pm,Q{x)\ < 


\y-x\<\x\/2 J\y-x\>\x\/2 


^ \ipm,o(x -y)\\x- y\ Nl E(x,y)dy = A + B , 


where E{x,y) := sup^^ sup £e ( 0)1 ) \d^rj(x + £(y — x))\//3\ (with [3\ the standard factorial 
for a multi-index (5 G N d ). By (3.4) we have \E(x,y)\ < c(l + h|) -71 when \y — x\ < \x\/2. 
This estimate together with (3.5) applied on ip, yield 

A<C J \ip m ,o{x - y)\\x - y\ Nl dy ■ (1 + |z|) -71 

< C 2 md ' 2 [ (1 + 2 ra |y|)-( iVl+d+7 ^|y| iVl dy • (1 + |x |)“ 71 

J\y-x\<\x\/2 

< c 2 md/2 2 _m ( iVl + d )(i + |x|)“ 71 . 

Moreover, since E is bounded, we have 


B < C 


\y-x\>\x\/2 


\ipm,o(x - y)\\x - y\ Nl dy 


< C 2 md / 2 / (1 + 2 m \x - y\)~( Nl+d+ ^\x - y\ Nl dy 

J\y-x\>\x\/2 


< c 2 md/2 2~ m ( Nl+d \l + |x|) -71 . 


This proves (3.6). 


We can now state the main result of this section. 
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Proposition 3.5. Suppose rj G Dp] 1 (W l ) n Mp] 2 (M d ) and ip G Dp 2 (W l ) n Mp] 1 (W l ) for 
some Ni, N 2 G N and 7 > d. Consider the matrix operator T given by 


(3.8) 


(T(c,dW : = c hk© k ^',k') 

jez,kez d 


for p 6 [1, 00 ) and 1 = 1/p + l/p f . Then T is bounded on t T (fL x Z d ) for any r in the 
range 


(3.9) 


JVi 


d 


A — < r <p 1 + — , 


N , 


d 


with A{x) = A(x,p,d/ r y) given by ( 3 . 2 ). 
Proof. By Lemma 3.4, 


\(Vj,k,'ipj’,k')\ < { 


C2 (i-?)( d C+Ni) 

(1 + \k- 2 >-i’k , \r 

C2C-i)C/2+N 2 ) 


for f > j, 

for j' < j. 


1(1 + \k' - 2 j'-3k\p 
Recalling the definition of ip p ^ k and r^ k in Eq. (3.1), this gives for j 1 > j, 

oO‘-'/)(d/ 2 +JVi) 

^ 02 2 (1 + \k - 2j~j'k'\r 

_ fj2 d (d/p-d/2+d/2)2-j , {-d/p l -\-d/2+d/2) _ z 


(1 + \k - 2i~i'k'\r 


= C- 


2^~i')Wp+ N i) 


(1 + |A: - 2i-i’k'\r' 

A similar argument shows that for j > j 

, 0 (j'-j)(d/p'+N 2 ) 

I - C (i + | k> — 2-?'--?A:|)^‘ 

We can thus write 

(3.10) S bj’~j(Ni/d, N,/d) <y_ iiW ( 7 ) 

The result now follows using Lemma 3 . 2 . 


□ 


It is the lower bound in (3.9), A(Ni/d) < r, that will be most important to us in the 
subsequent sections. The following example shows that this bound cannot be improved 
in the general case. 

Proposition 3.6. Suppose ip G L 2 (M) has fast decay and exactly N\ G N vanishing 
moments, i.e., ip G but ip R) for any 7 > 1, and suppose 77 G C' 00 (M) 

is compactly supported. Consider the matrix operator T given by (3.8) in Proposition 3.5 
for some p G [1, 00 ). Then T is not bounded on £ r (Z x Z) for any r < (Ni + 1 /p) -1 . 
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Proof. It suffices to prove that Yhj,k = oo for r < 1 /(Ay + 1/p). Since ip has 

exactly Ay vanishing moments and fast decay, there exists a function 9 with fast decay, 
such that ip = (—l) iVl 0( iVl ) and / R 0(d dx ^ 0, see e.g. [19, Theorem 6.2]. Observe that 

{Vj j,k ) = f ri(x)2 j/2 ip(2 J x - k ) dx 

Jr 

— 2 _JiVl (—l) iVl f r](x)2^ 2 9 ( ' Nl ' > (2 : ’x — k) dx 

Jr 

= 2~ jNl [ rj( Nl \x)2 j/2 9(2 j x-k)dx 

Jr 

- 2-A^i+i/ 2) ^(iVd * W2 _.) (2 -Jk) t 

where w £ (x) := e^ v 9{—x/e). Since r/ G C ,0C (]R) has compact support, lim £ ^. 0 ^ * 

w £ ) (x) — a ■ rf Nl \x) uniformly, where a — f R 9(x) dx , see e.g. [12]. Moreover, if we define 
the Riemann sums s £ := eYlkez \v ( ' Nl ' > {s:k)\ T , r e (0, oo), we have lim £ ^. 0 s £ = 
Combining these two properties, there exists eo > 0 such that 

eV *w e )(e£;)|' r > cHp^Hy for all e G (0, io)- 
kez 

Thus, since ip r ^ k = 2^ 1 ^ p '~ 1 ^ipj^, there exists j 0 > 0 such that for all j > j 0 , 

E ifov&r = 2 -«+ i - i />')'e iC" 1 ’ *«>2->)(2-Jt)r 

/cG^ /cG^ 

> c2 _ ' J ^ JVl+1/, ^ T 2“'j|77^ iVl ^|| T . 

Now, in order to get JV A |(r/, | T < oo we need to have —(Ay + l/p)r + 1 < 0, that is 
to say r > l/(Ay + 1/p). □ 


4. Approximation spaces 


The remaining sections contain the core of the paper. We begin by introducing some 
notions of approximation theory that will be used throughout the rest of the paper. In 
Section 5 we prove a Bernstein inequality that provides “one half” of the characterization 
of m-term approximation with “nice” wavelet bi-frame systems. In Section 6 we will con¬ 
sider Jackson inequalities that match the Bernstein inequalities in order to get complete 
characterizations. 

A dictionary V — {gkjkets in L p (K d ) is a countable collection of quasi-normalized 
elements from L p (M. d ). For V we consider the collection of all possible m-term expansions 
with elements from V: 


£ m (X>) := { ^2 CtfJz 

ie A 


Ci G C, card A < m 


}• 


The error of the best m-term approximation to an element / G L p (W l ) is then 



© 2004 Elsevier. This is the author version of an article published in an Elsevier journal. The original publication 
is available at www.sciencedirect.com with DOI: 10.1016/j.acha.2004.01.004 


BI-FRAMELET SYSTEMS CHARACTERIZE BESOV SPACES 10 


Definition 4.1 (Approximation spaces). The approximation space A'*(L p (R d ).V) is de¬ 
fined by 


\f\A%(L p (Xd),V) 



< oo, 


and (quasi)normed by \\f\\ A <*{L p (m.%v) = \\f\\ P + |/U«(l p (k^),d), for 0 < q : a < oo. When 
q — oo, the i q norm is replaced by the sup-norm. 


In what follows, we use the notation V W to indicate V is continuously embedded 
in W for two (quasi)normed spaces V and W, i.e., V C W and there is a constant C < oo 
such that || • ||vc < Cl ' llv- 

It is well known that the main tool in the characterization of A q (L p (K d ), V) comes from 
the link between approximation theory and interpolation theory (see e.g. [11, Theorem 
9.1, Chapter 7]). Let l^(R rf ) be a (quasi)Banach space continuously embedded in L p (R d ) 
with semi-(quasi)norm | • \ Yp . Given a > 0, the Jackson inequality 

(4.1) <Jm(f,V) p < Crn~ a \f\ Yp{Rd) , Mf G l^,(M d ), Mm e N 


and the Bernstein inequality 


(4.2) \S\ Ypm < C7 , r7i«|| ( S'||„, VS e Y m {V) 

(with some constants C and C' independent of /, S and m) imply, respectively, the 
continuous embedding 

(1,(1"). ^ K"). V) 

and the converse embedding 

(1,(1"). Y r (#)) 0l ^ ^ ^(1,(1"). V) 

for all 0 < /3 < a and q e (0, oo]. Here (X,Y)q jQ denotes the real interpolation space 
between the Banach spaces X and Y. We refer the reader to [1] for the definition of the 
real interpolation method. 

Thus, to get a complete characterization of the approximation space A°(L p (R d ), X (T)), 
we need to prove associated Jackson and Bernstein inequalities. In Section 5, a Bernstein 
inequality for an MRA based bi-frame system is proved. In Section 6, a Jackson inequality 
is proved for a general bi-frame system under mild assumptions on the smoothness, decay 
and number of vanishing moments of its generators. The proof relies on the matrix lemmas 
proved in Section 3 and provides, at the same time, a characterization of Besov spaces in 
terms of the i T norm of the dual (properly re-normalized) frame coefficients that appear 
in Eq. (1.1). The set of approximation spaces we are able to characterize in Section 6 
is limited by the number of vanishing moments of the functions in the dual frame 4. 
In Section 7, Jackson and Bernstein inequalities for a twice oversampled MRA based 
bi-frame system are considered. For such a system the Jackson inequalities are, in fact, 
independent of the number of vanishing moments of the functions in T. This leads to a 
complete characterization of the approximation spaces for oversampled framelet systems 
without any assumptions on the number of vanishing moments. 
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5. Bernstein estimates for the bi-frame system 

Let us begin by proving a Bernstein inequality for bi-framelet systems, since such an 
inequality will show us “how big” «4^(Z/ p (K d ), X(\&)) can possibly be. In the following 
we denote by W s (L 00 (M. d )) the Sobolev space consisting of functions with s distributional 
derivatives in L 0O (K. d ). Given a function d> £ L 00 ( R d ), let 

T = {k £ Z d : \{x £ (0, l) d : <f{x — k) ^ 0}| > 0}. 

We say that {</>(• — k)} keZ d is a locally linearly independent set if the set {</>(• — k )}^ e r 
is linearly independent. For MRA based bi-frame systems we have the following Bernstein 
inequality. 

Proposition 5.1. Let X(\&) ; X(#) be a bi-framelet system and assume that X(\k) is 
based on a compactly supported refinable function <p where: 

(1) cj) £ IV s (Too (M d )) with s > 0; 

(2) {(/)(• — k)} keZ d is a locally linearly independent set (this condition is void if d = 1 ); 

(3) The functions ti(£), 1 < £< L are trigonometric polynomials (see Section 2). 
Then the Bernstein inequality 

(5.1) |SV( MRd)) < Cm a \\S\\ Lp{m , VS £ S ro (I($)), Vm > 1 
holds true for each 0 < a, < s/d, 0 < p < oo, with 1/r a + 1/p and C — C(a,p). 

Proof. In the case d — 1, if the integer shifts of the function <p are not already linearly 
independent, we can always find a perfect generator <j) for the shift invariant space So := 
span{0(- — k): k £ Z}, i.e., 0 is a compactly supported refinable function with linearly 
independent shifts that generates So, see [18, Theorem 1], In particular, there exists a 
finite sequence {a*,}*, such that <f>{x) — a k < l ) ( x ~ k). In the arguments below we may 
use <p in place of <f>. 

By the result of Jia [17], for each 0 < a < s/d , the Bernstein inequality 
1*51 B$ a (L T (R d )) < C < 7Tl Q ||S||^ j) ( R d), VS £ E m (X(</>)), 

1/t := a + 1/p, 0 < p < oo, holds true for the system 

-V0) := ~ k)}jez,kez d - 

Now, since V(T) is based on 0 we have finite masks {b ( k }k such that 

V(d = b k^ 2x ~ k )- 

he z d 

Thus, for j £ Z and k £ Z rf , we have 

(5.2) i'p l {2 3 x — i) = 6|.0(2- ?+1 x — 2i — k) 

kez d 

That is to say '0S £ T K (X (S)) for some uniform constant K depending only the length 
of the finite masks used above. Take any S £ E m (X(\k)), then S £ E Krn (X (<;/>)). Using 
the Bernstein inequality for X ( <f ) we obtain the wanted inequality, 

< C(Km) a \\S\\ Lp ^ 

< Cm a \\S\\ Lpm , VS £ E m (I($)). 
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□ 

Proposition 5.1 shows that, at best, the approximation space (L p (Mf ), X('I')) will be 
(essentially) a Besov space. Indeed, as mentioned in Section 4, the Bernstein inequality 
(5.1) implies the continuous embedding 

(5.3) (L P (K>),B^ (LM)))^, 

for 0 < (3 < a and q G (0, oo]. It is well known (see, e.g.. [9]) that the right hand side of 
(5.3) equals the Besov space B^ d (L q ( R d )) when l/q = B + 1/p. 

Thus, in order to be able to completely characterize A^(L P ( R d ),X(\k)), we need to 
obtain a matching Jackson estimate for smooth functions in B da (L r (M d )). We address 
this problem in the following section. 


6 . Jackson estimates for the bi-frame system 

In this section, we derive a Jackson inequality that matches the Bernstein inequality 
obtained in the previous section, and we obtain a complete characterization of m-term 
approximation with “nice” wavelet bi-frame systems. The basic tool is the matrix lemmas 
given in Section 3. The main result of this section is the following theorem. 

Theorem 6.1. Let W(^), X(4 f ) be a wavelet bi-frame system and assume that X(\k) is 
based on a compactly supported refinable function <p where: 

(1) (p G W^L^M?)) with s > 0; 

(2) {</>(• — k)} keZ d is a locally linearly independent set (this condition is void if d— 1); 

(3) The functions ©(£) are trigonometric polynomials (see Section 2); 

(4) C C^( R d ) fl M^ 1 (M d ) for some (3 > 0, N\ G N and 7 > d. 

Let p G (1, 00 ) and t := (a + 1 /p)~ l where we assume 


(6T) O^minp’AM 

with A(x) = A(x,p,d/'y) given by (3.2). Then, for each 0 < f3 < a, q £ (0, oo], we have 
the characterization 




y/a,q ‘ 


We refer the reader to Definition 3.3 for the class R d ). The above theorem is 
obtained simply by combining Proposition 5.1 above with Proposition 6.2 below. 


Proposition 6.2. Let W(^ r ), X(^) be a bi-frame. Suppose 'k C Mff 1 ( R d ) for some 
Ni G N and 7 > d and suppose there exist {3, e > 0 such that for all ip G f U§, 
ip G and \ip{x)\ < C( 1 + \x\)~ d ~ £ . Then, we have the Jackson inequality 


Vm(f,X{^)) p < Cm Q ||/|Uda (Lr(K d )) 

for p G (l,oo), A (Al) < t < p, and a — 1/r — 1/p, with A(x) — A(x,p, d/ 7 ) given by 
(3-2). 
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We will provide a detailed proof of Proposition 6.2 later in this section, but let us first 
give an outline of the proof. First, using the characterization of Besov spaces in terms 
of wavelet coefficients, we know that the wavelet coefficients of any / G Bf a (L r { R d ) are 
in £ t . Then, using the matrix lemmas from Section 3, we show that the wavelet-frame 
coefficients in Eq. (1.1), once properly normalized, are also in £ r for r admissible. Thanks 
to the Lp(M d )-stability of bi-frame expansions -which we prove in Appendix A- we show 
that this implies / G /C T , T (L p (W l ), A(T)), where the latter class is a “sparseness class” 
of functions in L p (R d ) with well-defined t T -summable frame expansions in X(T). We 
can then conclude and get the desired Jackson inequality by relying on a more general 
“abstract” Jackson inequality with = /C T T (L p (M d ), A(T)) proved in [14, Theorem 

6 ], 

Let us now give the definition of the sparseness classes /C Tj7 -(L J ,(K d ),X( 1 J r )). Then, we 
will have all the tools in our hands to prove Proposition 6.2. 


6.1. Sparseness Classes. It will be proved in the appendix that the bi-frame system 
X(T), once normalized in L p (M d ), has the so-called t P) i-hilbertian property (see Corol¬ 
lary A.4). Applying Proposition 3 in [14], it follows that we can define, for p G (l,oo), 
t < p and q G (0, oo]: 


£ T ,dT p (M d ),X(40) = \f e L p (R d ) 






0 r,q 


’ f ^ ^ 

t= i jez,kez<i 




and \f\K T , q (L P m,xm the smallest Lorentz norm \\{^ k }j,k,e\\er, 9 such that / 
(see, e.g., (A.3) for the definitions of the Lorentz norm on £ p , q ). 




6.2. Proof of a Jackson inequality for the bi-frame system. We are now ready to 
give the proof of Proposition 6.2. 

Proof of Proposition 6.2. Take / G B da (L r (R d )). The expansion of / in the Meyer wavelet 
system {rf }%V is given by 

f=Yl 44’ l/r-l/p = a, 

i-i jez,kez d 

with coefficients c^ k satisfying ||{c* ;fc }||^ 7 . x \\f\\B da (L T (i}t d ))i see [10]. Now we calculate the 
bi-frame coefficients of /: 



Since rf G Dfj (K d )nM^ v (R d ) for any N and 7, we can apply Proposition 3.5 for each £ and 

i to get that ||{(/, i >%)}Ik < C\\f\\B da (L T m d ))- According to Corollary A.4, the system 
X(T) is t'p.i-hilbertian so the canonical frame expansion (1.1), which can be rewritten as 

/=EE 

t=i jez,kez d 


(6.3) 
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is unconditionally convergent in L p (W l ). It follows that / G /C T)T ((R d ), X (4/)) and 
\f\K T ,r(L P m^m < C\\f\\ B da(L T (R<i))- Eventually, using again the fact that the system 
V('-X') is fpj-hilbertian. we obtain by [14, Theorem 6] 

<7 m(/jX(4')) p < Cm a \f\K T , T (L p (m. d ),x(v)) 

< Cm-«\\{(f^%)}\\ lT 

(6.4) < Cm- a \\f\\ B da {LTm) . 

□ 

Remark 6.3. Notice that the sparse representation of / G Bp a (L r (W d )) we use to prove 
that / G JC TjT (L p (R d ), X (T)) is exactly the canonical bi-frame expansion (properly nor¬ 
malized in L p (R d )). Thus, to realize the rate of approximation given by the Jackson 
estimate, we simply threshold the coefficients of the expansion (6.3). 

6.3. Characterization of the sparseness classes. One of the interesting byproducts 
of the proof of Proposition 6.2 is that, under the hypotheses of Theorem 6.1, we also get 
a characterization of Besov spaces and approximation spaces in terms of the sparseness 
classes. Indeed, one can deduce from the proof of Proposition 6.2 that 

(6.5) sf (l t ( k")) = {/ e t) =ic,,AL p (g?),x(v)) 

for a = 1/r — 1/p and r admissible (see the statement of Proposition 6.2). The embedding 
of B da (L r { R d )) into the middle class in Eq. (6.5) is derived explicitly in the proof of 
Proposition 6.2. The embedding of the middle class into JC TjT (L p ( III d ), X(\f)) is trivial, 
and the converse embedding, JC TjT (L p ( R d ),X(\k)) e — B da (L r (W 1 )) is a consequence of 
the Jackson estimate /C T)T (L p (M d ), V(T)) «->■ A ( p{L p {W l ). X(4/)) -which follows from [14, 
Theorem 6]- and the Bernstein “embedding” (5.3) for the system X(\k). 

7. On the role of Vanishing Moments 

In this section, we discuss the role of the number N\ of vanishing moments of the bi- 
framelet system with respect to its ability to characterize Besov spaces through sparseness 
spaces and/or approximation spaces. We say that a collection of wavelets T has vanishing 
moments of order N if each ip G has at least N vanishing moments (ie., ip G AT v (M d ) 
for some 7 > 0, see Eq. (3.5)) , and at least one of the functions in T has exactly N 
vanishing moments, ip */ Mpp' for any N' > N. We denote by VM(T) the order of 
vanishing moments of T. 

At the beginning of this section, we discuss the fact that when VM(4) is “small”, the 
range of smoothness parameters a for which Besov spaces are characterized by bi-framelet 
expansions/approximations (see Theorem 6.1 and Eq. (6.5)) can be rather limited. Fol¬ 
lowing an idea introduced in [15] we propose means to overcome this limitation: since 
the system A'(T) is redundant, one can choose another representation of any / than its 
canonical frame expansion; by replacing the canonical frame representation with a “bet¬ 
ter” one, it is sometimes possible to recover the characterization of Besov spaces even 
when VM/T) is small. However, the price we have to pay for getting better expansions is 
that we can no longer expand the functions in the canonical system A/T). but we have 
to consider expansions in the twice oversampled system, which will be defined soon. 
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7.1. Limitations of the canonical frame expansion. Let us first motivate the need 
for the results of this section by discussing the role of the number Ni := VM(d') of 
vanishing moments of the dual bi-frame, and some limitations of Theorem 6.1 when we 
use framelet systems with few vanishing moments. 

As stated in Theorem 6.1 the number of vanishing moments puts a limit to the range 
of approximation spaces for the bi-framelets that can be characterized even if we assume 
the wavelets are smooth and compactly supported. Notice that by Eq.( 3 . 2 ), 

1 1 (~ 1 ) _1 ' 1/Ap if x - 1 - 
- 7 -t -~=\x if 1 - 1 /p < x < 7 /d - 1 /p, 

’ b I 7 /d— 1/p if 7 /d — 1/p < x. 


Thus, if Ni/d > 1 — 1/p, the set of admissible a in Theorem 6.1 is given by 


(7.1) 


0 < a < min 


s JVi 7 

d’T’d 



If Ni/d < 1 — 1/p, the set is even more restricted. This shows the importance of the three 
qualities of a wavelet: smoothness, vanishing moments and decay. A “good” bi-frame 
system, X (T), X (T), in this context is when the functions 4 / are compactly supported and 
smooth, and the functions 4 > have fast decay and a large number of vanishing moments. 

On Figure 1 we display, as a function of N\/d (and with s, d, p and 7 fixed), the range 
of admissible values of a. 


a 



Figure 1 . A crosshatch of the range of admissible values of a in Theorem 
6.1 (given by (6.1)) as a function of A r 1 /d, a max = min{s/d, 7 /d — 1/p}. 


VM(' 0 ) for univariate wavelets. For a univariate orthonormal wavelet ip e L 2 (R) fl C r (M) 
with decay |^(^)| < 0(1 + \x\)~ r ~ £ for some e > 0 , it is well known that ip has (at least) r 
vanishing moments, see e.g. [16, Theorem 2.3.4], So for orthonormal wavelets, smoothness 
and decay enforce vanishing moments, and we get the characterization 

(7.2) = (L p (R),B?(L T (R)))„ /<ti5 

for all 1 < p < 00, 0 < q < 00, 0 < (3 < a < r, and r = (a + 1 /p) -1 . 
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VM('I') for univariate tight spline framelets. The relation between smoothness and van¬ 
ishing moments satisfied by orthonormal wavelets, is far from true for wavelet frames. The 
most well-known example is the family of spline-based tight wavelet frames built through 
the unitary extension principle: the smoothness s, which corresponds to the degree of 
the splines, can be arbitrarily high; however at least one of the framelets will have only 
one vanishing moment, see [8], hence Ni — VM(T) = VM(T) = 1. In this case, since 
the wavelets have compact support, we can take 7 Afi and check that the range of 
admissible a is exactly 0 < a < Ni = 1 while the smoothness s of T might be arbitrarily 
large. 

Wavelet hi-frames with few vanishing moments in L p (R d ). Suppose the decay 7 and 
smoothness s are large compared to the number A 7 of vanishing moments of V (T): 

s > Ni 
7 > d + N 1 

and that the latter is not too small, i.e., N\ > d. Then, again, the range of admissible a 
is exactly 0 < a < Ni/d. 

At first sight, it seems that the possibly limited number of vanishing moments of bi- 
framelet systems is an obstacle that prevents them from having good approximation 
properties, in the sense that the characterization ( 7 . 2 ) is for a limited range of a. The 
rest of this section is devoted to showing that this restriction can be avoided by using an 
oversampled version of the framelet system. 

7.2. Beyond the canonical frame decomposition. We have mentioned that to get 
characterizations of Besov spaces with systems with few vanishing moments, we will need 
to oversample them. Given a wavelet bi-frame X(\k), A (T) and R > 1 we let A^(T) 
denote the oversampled system, 

:= { 2 jd/ V( 2 J • —k/R)\j e Z, k G Z d ,£ = 1 , 2,..., L). 

Just as the non-oversampled system, the oversampled one Xr(\&) is a frame in L 2 (K) 
and is f^i-hilbertian in L p ( R d ) after proper normalization. See, e.g.. [15, Theorem 4.11] 
for a proof in the case R — 2 , which directly extends to arbitrary oversampling factors 
R 6 N. Denoting 'tpj(-) — • —k/R),I € D R , where D R is the collection of 

all “oversampled dyadic intervals” / = 2 ~ jl ([ 0 , l] d -\- k/R), we can thus define sparseness 
classes for p e ( 1 , 00), r < p and q e ( 0 , 00] as follows 

3{4}« €<,,„/= 4^ 

I<ED R ,l<EE 

and |/|k t ,,(l p (r< 7 ,x W ) the smallest Lorentz norm ||{4Wlk,<, such that / = Y,i,t 

It turns out that, when A(\k),X( 1 J) is a “nice” wavelet bi-frame, the oversampled 
system A#(T), again, gives rise to an approximation space no larger than a Besov space. 
Indeed, using refinability in (5.2) we can prove a corresponding Bernstein inequality for 
the oversampled system A^(T), by exactly the same arguments as given in the proof of 
Proposition 5.1 (see [4] for details on how to use refinability to deal with the oversampling). 

Proposition 7.1. Let A(\&) ; X(fii) be a bi-framelet system with A(\&) based on a com¬ 
pactly supported refinable function <f> where: 


A T , 9 (L p (M d ),^W) := {/ G L p (R d ) 
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(1) cf) G W s (Loo(K d )) with s > 0; 

(2) {(}){■ — k)} keZ d is a locally linearly independent set (this condition is void if d = 1 ); 

(3) The functions ti{£), 1 < l < L are trigonometric polynomials (see Section 2). 
Then, the Bernstein inequality 

< Cm a \\S\\ Lp(R d ) , VS G Vra > 1 

holds true for each 0 < a < s/d, 0 < p < oo, with 1/r := a + 1/p and C = C{a,p). 

Again, in order to get a complete characterization of Besov spaces in terms of the 
approximation spaces based on we need to prove a matching Jackson estimate. 

Just as in Section 6, we will use the fact that we can find a “nice” orthogonal wavelet 
system with a sparse expansion in Abj('I'). However, the sparse expansion that we will 
consider is no longer expressed in terms of canonical frame coefficients of the orthogonal 
wavelet(s), but rather on adequately chosen sparse synthesis coefficients, as can be seen 
in the next lemma. 

Lemma 7.2. Let X(T) be a wavelet bi-frame system. Let if G L 2 (K d ) for which 

there exists a sequence {d k } eeEj keZ d and Re N such that 

x ) = 4/b - k/R). 

l=i kez d 

Then, for 1 < p < oo, and r < p such that {d l k } efifl t T , we have 

JC T , T (L p (R d ),X(if)) JC TjT (L p (R d ), X#(\k)). 

Proof. Let / G JC TiT (L p (R),X(if)). Then / = J2j,k c hk^,^ for some sequence {c 3 . k } € i T . 
We rewrite this as 

/= E Cjjpl’MV ■-k) 
jez,kez d 

= E 

jez,kez d i=i k'ez d 

= E 

j£Z,k£Z d n£Z d 

= E EfE^-®.) 2 ' 7 ^'*^ --n/R). 

jez,nez d i=i ' k ' 

It is easy to check using brute force for 0 < r < 1, and Young’s inequality for 1 < r < p, 
that 

J2 J2 J2 c L kd n-Rk 

jez,nez d t=i k 

and we conclude that indeed / G /C T;T (L p (R), V# (\&)). □ 

The following corollary gives more details on how Lemma 7.2 will be used to prove the 
desired Jackson inequality. 


<i.max(||{4}|M|{4}||; r )- Y, I c i* 

jez,kez d 
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Corollary 7.3. Let X (fit), V(4) be a wavelet bi-frame system, X a bi-orthogonal 

wavelet basis and r > 0 such that 

B da (L r (R d )) = 0 < a — 1 /t — 1/p < r. 

Assume that for 1 < i < 2 d — 1 there exists sequences {d e f l }i eE ,kei, d C C/{r+i), such that 

#(d = - k/R). 

1=1 kez d 

Then, for 1 < p < oo, and 0 < a — 1/t — 1/p < r, we have the Jackson inequality 

^m{f,X R (^)) p < Cm- a \\f\\ B d« {LTiRd)) . 

Proof. First, notice that we have (see [14]), for 0 < a = 1/r - 1/p < r - 

2 d -l 

B da (L r (R d )) = /C T , T (L p (R d ),X({^}& 1 )) = ^/C T , T (L p (M d )^(^))- 

i =1 

Then, since r = (cu + l/p) -1 > (r + l/p) _1 > (r + l) _1 we have ^i/( r +i) C l\. Hence, using 
the fact that {df/} e £i/( r +i) H t\ C we can apply Lemma 7.2 to get, for each i. 

/C T , T (L p (K d ),^(#)) ^ K T , T {L p {^ d ),X R {<H)). 

The conclusion follows just as in the proof of Proposition 6.2. □ 

We can now combine Corollary 7.3 and Proposition 7.1 to get the following complete 
characterization of the approximation spaces A/(L p (M d ) W 2 (T)). Notice the extended 
range for a as compared to Theorem 6.1. 

Corollary 7.4. Suppose V(T) , V(T) satisfy all the assumptions of Proposition 7.1 and 
Corollary 7.3 with parameters s and r, respectively. Then, for 0 < a < rninp, r), 0 < 

/3 < a, q € (0, oo], we have the characterization 

(7.3) ^(L p (if'),X 2 (>I-)) = 

Now the task is to show that the assumption of Lemma 7.2 is satisfied for interesting 
framelet systems. This will be done in the following subsection where we restrict to the 
univariate case d — 1. 


7.3. Constructing wavelets out of framelets in the univariate case. In our stra¬ 
tegy to get a Jackson inequality for the (oversampled) framelet system X R (^>), the crucial 
issue is to identify some “nice” wavelet (s) that can be expanded sparsely in terms of 
oversampled bi-framelets. In [4] such a construction is proposed in the multivariate case 
for wavelet type systems that need not be frames, however there is no control on how large 
the oversampling factor R must be, and the proof is not constructive. In the univariate 
case for spline-based tight wavelet frames, it was shown in [15] how to get a finite expansion 
of a nice semi-orthogonal wavelet in the twice oversampled (R — 2) framelet system. Here, 
still in the univariate case, we propose a construction which is valid for more general 
wavelet bi-frames and only requires R = 2. It is an interesting but open question whether 
similar constructions are possible in the multivariate case. 

The constructed wavelet will be the “standard” orthogonal wavelet associated with the 
MRA underlying the bi-framelet system. 
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Definition 7.5. Let <p be a univariate scaling function generated by the refinement filter 
7 o(£), and let P(fi) := Yhkez |0(£ — k)\ 2 . The “standard” orthonormal wavelet ip associated 
with the scaling function <p is defined by 

(7-4) ip( 2£) = e“^ro(C + n) -j== . 

Let us recall that the number of vanishing moments of the standard orthonormal wavelet 
associated to <p is given by 

VM(-0) = min{iV, |t 0 (£ + tt)| = 0(|£D around = 0 }, 

see, e.g ., [20]. Moreover, as discussed previously, if <p is r-regular (see [20]) then so is ip 
and we have VM (ip) > r. 

Proposition 7.6. Let X(ff') be an MRA-based wavelet bi-frame system with com¬ 

bined mask r = (to, 7 i, ..., tl) and t, and let <p and ip be respectively the scaling function 
generated by r 0 and the associated standard orthonormal wavelet. Suppose that 

• each filter tp, 0 < t < L, is a triqonometric polynomial: 

• E&=i l r 4£)| 2 > 0 for 7 ^ 0 ; 

• <p is an r-regular scaling function (not necessarily orthonormal); 

• VM(tf) < YM(ip). 

Then ip can be expressed as a linear combination 

ip(-) := - k/2), A! - {ip\ip 2 : ..., ip L }, 

£—1 

where {d[} e HtxT- 

Remark 7.7. It is clear that such a construction will fail if we have VM(^) < VM(T). This 
follows from the fact that a sparse expansion of functions with at least VM(T) vanishing 
moments will also have (at least) VM('I') vanishing moments, while the orthonormal 
wavelet will have exactly VM(-0) < VM(\&) vanishing moments. However this is not a 
problem since, in such a case, V(T) has enough vanishing moments to get a Jackson 
inequality from the theory developed in Section 6. 

Proof. We want to expand the standard orthonormal wavelet ip in the twice oversampled 
framelet system. In the frequency domain the problem is to find “nice” 27r-periodic 
functions Qiifi) such that 

L 

m = 2 )T,K/ 2 ) 0 (f/ 2 ). 

t=l 

We will look for Qp of the form Qpffi) = Q(()tp((). Using Eq. (7.4), we see that the 
problem will be solved if Qp has fast decaying Fourier coefficients and Q satisfies 

£=1 V 
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Hence, we define for £ ^ 0 


(7.5) 


n ,c, T t(() ■ « * 

' eLwoi 2 vm ' 


Let us check that Qt can be readily extended at £ = 0 and that the resulting extension has 
no pole on the unit circle. First, we have 1 , for £ close to zero, \ T dO\ 2 x |£| 2 ' VM W. 
Then, we use the fact that, for £ close to zero, 


|t4£)to(£ + 7t)| 0(|£|VM«+VM W) 0(|£|2-VMW). 


We conclude by proving that the Fourier coefficients of Qi decay faster than any poly¬ 
nomial. Notice that P(£) -1 / 2 is C°° (see, e.g.. [20]) so its Fourier coefficients decay faster 
than any polynomial. The factor 


Eii km 2 

in (7.5) is a quotient of two trigonometric polynomials with no pole on the unit circle, so its 
Fourier coefficients decay exponentially which can be seen from its Laurent expansion. □ 

By combining Corollary 7.3 and Proposition 7.6 we get 

Corollary 7.8. LetX{^), X(’F) be an MRA-based wavelet bi-frame system with combined 
mask t = (to, tl, ..., r£) and f, and let ([) and if be respectively the scaling function 
generated by r 0 and the associated standard orthonormal wavelet. Suppose that 

• each filter To, 0 < l < L, is a trigonometric polynomial; 
•Etil^)| 2 >0/or£^0; 

• 6 is an r-regular scaling function (not necessarily orthonormal); 

• VM(tf) < VM(-0). 

Then, for 1 < p < oo, 0 <t <p, 

JC T , T (L p (R),X(if)) ^ )C r , T (L P ( R),* 2 (tf)). 

In particular, for 1 < p < oo and 0 < a = 1 /t — 1/p <r, we have the Jackson inequality 

cr m(/,^2(^ f ))p ^ Cm~ a \\f\\ B »(L T (R))- 

Proof. Using Proposition 7.6, we get the sparse expansion coefficients which make it possi¬ 
ble to apply Lemma 7.2 to obtain /C TjT (L p (M), X{if)) <—>■ /C T;T ( L p (M), AC(T)). The second 
claim follows from Corollary 7.3 using the fact that JC TjT (L p (R),X(if)) = Bf(L T (W)), 
a G (0, r), a = 1/r — 1/p, see e.g. [10]. □ 

Remark 7.9. In particular, Corollary 7.8 applies to any tight framelet system T with 
VM(T) = 1. For example, it applies to tight framelet systems constructed using the UEP 
based on the B-splines [8, 15]. 


1 By FxG 1 we mean that there exist two constants 0 < c < C < oo such that cF <G < CF. 
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8 . Conclusion 

In this paper we have studied approximation with wavelet bi-frame systems in L p ( R d ), 
1 < p < oo, and we have characterized the associated approximation spaces A a and 
shown that they are essentially Besov spaces Bp a (L r (R d )), with a = 1/r — 1/p. The 
characterization holds true for the smoothness parameter a in a certain range depending 
on the number of vanishing moments for the bi-frame system. It is also shown that for a 
function / in a Besov space with smoothness parameter in this range, the corresponding 
canonical L p (R d )-normalized bi-frame expansion of / is sparse in the sense that the frame 
coefficients are contained in i T . Moreover, the rate of best m -term approximation to / is 
obtained simply by thresholding the canonical expansion. 

For twice oversampled univariate wavelet bi-frames, we give a complete characterization 
of the approximation spaces in terms of the Besov spaces B®(L T (R)). The characterization 
holds true even for systems with few vanishing moments, and there is no restriction on the 
smoothness parameter a except the natural requirement that a is less than the smoothness 
of the generators of the wavelet frame. To obtain a characterization for wavelet bi-frames 
with few vanishing moments, we prove that there exists a “nice” orthonormal wavelet with 
a highly sparse expansion in the framelet system. This fact is then used to show that 
smooth functions in Bf(L r (R)) have sparse expansions in the twice oversampled wavelet 
bi-frame system with expansion coefficients in t T . 

There is one fundamental difference between the sparse expansions obtained for systems 
with many vanishing moments and for the twice oversampled wavelet bi-frames. When 
we have enough vanishing moments, we can use the canonical frame expansion and it 
will be sparse for smooth functions. For wavelet bi-frames with few vanishing moments 
we no longer use the canonical expansion, but show that a smooth function has another 
sparse expansion in the twice oversampled system, obtained through an expansion of the 
function in an orthonormal wavelet. 


Appendix A. Stability of wavelet bi-frame systems in L p (R d ) 

In this appendix we study stability of wavelet bi-frame systems in L p (R d ). Theorem A.l 
below will show that we can characterize the L p (R d )-norms by the analysis coefficients 
associated with the bi-frame. Theorem A.3 will show that there is a stable way to re¬ 
construct L p (R d )-functions using the bi-frame expansion. The main application of the 
stability result in this paper is to prove the Jackson inequality for the bi-frame system 
(Proposition 6.2). The technique to obtain the various characterizations in this appendix 
is similar to the technique introduced by the authors in [3]. 

Below we let D denote the set of dyadic intervals I = 2 -jl ([0, l] d + k). j G Z, k G Z d , 
ipi := ipj jk , and Xi denotes the indicator function for I. 

Theorem A.l. Let X(fh), X(^) be a wavelet bi-frame system. Suppose for all ip G ’FU'F 
there are (3 > 0 and e > 0 such that ip G C^(R d ), and 

\iP{x)\<C(l + \x\)- d ~ £ . 
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Then 

(A.1) 




I<ED,i<EE 




l€D,i€B 


for 1 < p < oo, where E { 1,2,..., L}. 

Proof. Let {'r/ 4 '}^ 1 be the orthonormal Meyer wavelet(s) defined on Mf. For each £ € E 
we consider the integral kernel 

K\x,y) := 

ieD 

Notice that the corresponding operator 

K l (x,y)f(y)dy 

jRd 

is bounded on L 2 (® d ) due to the fact that {%oj}ieD is a subset of a frame. Also, standard 
estimates show that (see e.g. [7]) 

\K e (x,y)\ < C\x-y\~ d , 

\K\x' : y)-K\x,y)\ < C\x - x'\ a \x - y\~ d ~ a , 

and 

\K‘M) - K‘(x,y)\ < C\y-y'nx - y\-‘- a , 
because of the smoothness and decay of ff. Thus T £ is a Calderon-Zygmund operator 
and therefore bounded on L p (R d ), 1 < p < oo. However T l f has a nice expansion in the 
orthonormal Meyer wavelet, so using the L p (K d )-characterization of such expansions we 
get 

II /_ \ l /' 2 1 


EK/^>l 2 l 7 l A/(u 


\ T f\\ P < c\\f\\ t 


Using this estimate for £ = 1,2,..., L, and the fact that £\ r —> £ 2 we get 




ieD,ieE 


= (£({£i(/.^>i 2 it-‘x,w) 

' leE ' ^ ieD ' 

f 

< £|£l</M>mr 1 x/(*)| 

ieE ^ ieD > 


l/2\ 2x 1/2 


<i-C||/|| p . 

Using a similar argument applied to the frame A (T), we may conclude that 


/ \ 1/2 

£ K/.^mr'x/M <i-cii/n, 

^ ieDieE ' v 
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Now we turn to the converse estimate. Notice that since we have a bi-frame we have 
the identity 

(f,9)= f,geL 2 (R d ). 

ieD,teE 

Write 


Wf(x) = {\I\ 1/2 (f,^i)xi(x)}i/, Wg(x) = {\I\ 1/2 (^Vl>X/(dh,^ 
and notice that for / G L 2 (R d ) D L p ( K d ) and g G L 2 (M d ) fl L p i( K rf ), with p~ l + {p')~ l = 1, 


(A.2) 


l</^> I = 


< 

< 

< 


f (Wf(x),Wg(x))e 2 dx 

J \\Wf(x)M\Wg(x)\\ l2 Ax 

||l|M / /WllfolUII|W , sWllfe|| I . 

r j\\mf(x)\\ l 2 l\\g\\g, 

lllll'TsWIlfe llp.il/llp- 


Taking the supremum of the estimate (A.2) for {g G L 2 (M rf ) H L p fR d ) : \\g\\ p > < 1} we 
obtain 

m r <c\\\\wf(x)\u 2 \\ r . 

This proves the result for / G T 2 (R d ) fl L p ( K d ). To complete the proof for / G L p (R d ) 
we just notice that from the first part of the proof it follows that / i->- ||W/(a;)||£ 2 is 
continuous on L p (R d ). 

Using (A.2) once more, taking the supremum for {/ G L 2 (R d ) fl ipM : ll/ll, < 1} we 
obtain \\g\\ pl < d||||W^a;)|k|| p ,. □ 


From Theorem A.l we see that the following sequence space plays an important role. 
Definition A.2. Let d p denote the set of sequences {cj}ieD/€E for which 

1/2 I 


-IS\\\P 


( l c /l 2 l 7 l Add) 

\ /. /i p. /.■ / 


■ ieD/eE 


< oo. 


In fact, let us show that there is a stable reconstruction operator defined on d p for 
bi-frame systems. 

Theorem A.3. Let X(^>) andXffy) be a bi-frame for L 2 ( K d ). Suppose for 

there exist (3 > 0 and s > 0 such that if G C^(M d ) and \if(x)\ < C( 1 + \x\)~ d ~ £ . Then the 

maps T: d p L p ( R d ) and T: d p L p (R d ) defined by 

t { 4 )= E rU}= E 

ieD/eE ieD/eE 


are both bounded linear maps. 
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Proof. We consider the operator U l with kernel 


K e {x,y) := ^2^j(x)T]}(y). 


By exactly the same arguments as given in the first part of the proof of Theorem A.l, 
it can be shown that U l is bounded on L p (JL d ). Take {cf}/en,res G d p and consider 
f e J2 i<ed c i r ii- This is a well-defined function in L p ( K d ) with 

n/'ii, -1 1/2 , 


where we used the characterization of L p (W) using wavelets. Thus, 

II E ^|L<X1IIZ1^IL = IZII^X 


ieDieE 


teE I£D 


^Ell/IL^E Ei^Hh-A/b 


ieE 11 x ieD 


lc ( e 


ieD,eeE 


and it follows that T: d p i-» L p (R d ) is bounded. The claim for T is proved using similar 
arguments with appropriately modified operator kernels. □ 

Recall the Lorentz space 1 < p < oo, 0 < q < oo, for some countable set A, as 

the set of sequences {« m }meA satisfying ||{a m }|b q < oo, where 


(A.3) 


i a mj\\i v 


= {{Y,7=o(V / ^»Y) ll \ 0 < g < oo, 


supj> 0 2 j/r a* 2j , 


q = oo, 


with {a,j}j=o a decreasing rearrangement of {a m } me \. 

It can be shown that there exist constants c, C > 0 such that 

/ \ 1/2 

(A.4) c||{cf}|| W Dxii) < E l4| 2 |/r 2/,, X/W < C||{4}|| (p , l(Dxi!) , 

' ieD,teE ' p 

for any {cf} 6 £ Pt i(D x E ), see e.g. [3]. 

From the above results we easily deduce the following important corollary. As before, 
we denote by ipj ,p the function ipj normalized in L p ( R d ), i.e. ||V’/ J, ||p x |/| i//2_i//|, ||'0yUp- 

Corollary A.4. Let X(\k) and X(4) be a wavelet bi-frame for L 2 (R d ). Suppose for all 
ip € \ku4 there exist (3 > 0 ande > 0 such that ip G C^M^) and \ip{x)\ < (7(1 + \x\)~ d ~ e . 
Then A/(W) and A/(^) are ^ P) i-hilbertian systems in L p (R d ), 1 < p < oo, that is to say 
we have 

II E 4+ <C T ||{^}lk I (DxE), 
ieD,ieE p 
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j [ £ < C p \\{c‘,}\Wadxe>, 

l€D,t€E P 

for any sequence {cf} C £ Pj i (D x E). 

Proof. Follows from Theorem A.3 and equation (A.4). 

This property is used to prove a Jackson inequality in Section 6. 
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